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METHODS AND APPARATUS FOR 
FIBRE CHANNEL FRAME DELIVERY 

Cross-reference to Related Applications 

The present application is related to concurrently filed U.S. Patent 
Application No. 10/114,394 (Attorney Docket No. ANDIP009) by Scott S. Lee and 
Dinesh G. Dutt and titled Label Switching In Fibre Channel Networks, the entirety 
of which is incorporated by reference for all purposes. 

Background of the Invention 

1. Field of the Invention. 

The present invention relates to fibre channel networks. More specifically, 
the present invention relates to methods and apparatus for providing in order 
delivery of fibre channel frames in a fibre channel network during circumstances 
such as changes in link state or changes in channel state. 

2. Description of Related Art 

Many conventional network protocols allow the out of order delivery of a 
packet sequence. A network node in a TCP/DP based network can receive an out of 
order set of packets and reorder the packets upon receipt. Packets often arrive out of 
order if they travel along different paths to reach a destination. 

However, some fibre channel devices such as disks, disk arrays, and other 
storage mechanisms can not handle out of order frames. Link and channel state 
changes are some of the circumstances that may cause out of order delivery of 
frames in a fibre channel fabric. Multiple links seen as a single link between two 
fibre channel entities is referred to herein as a channel. Some mechanisms in 
existing networks call for the flushing of all frames in the network upon a change in 
link state. Flushing all of the frames can prevent out of order delivery when paths 



1 



WO 03/085910 



PC1YUS03/09328 



and routes change in a network. All frames are flushed even if the paths for the 
frames to the associated destinations are not changed. However, flushing all of the 
frames either explicitly or implicitly can he very disruptive to network operation, as 
more frames are dropped than is necessary and network operation is at least 
5 temporarily halted. 

It is therefore desirable to provide methods and apparatus for improving 
fibre channel frame delivery and providing in order delivery particularly during link 
state and channel changes. 

10 Summary of the Invention 

Methods and apparatus are provided for improving fibre channel frame 
delivery. Techniques are provided for the in order delivery of frames by 
intelligently delaying or dropping selected fibre channel frames. Other techniques 
15 are provided for in order delivery by using label switching and frame labels. The 
various techniques can be applied during circumstances such as a link or channel 
state change. 

According to various embodiments, a method for selectively delivering fibre 
20 channel frames in a fibre channel fabric is provided. A set of next hops at a fibre 
channel entity is identified. The set of next hops is used to forward frames based on 
a destination identifier. A fibre channel fabric link change is detected. An updated 
set of next hops is identified. The updated set of next hops is used to forward a 
frame received at a fibre channel entity based on a destination identifier while 
25 accounting for the fibre channel fabric link change. The set of next hops is 
compared to the updated set of next hops. The forwarding of frames towards the 
updated set of next hops is prevented for a predetermined period of time if it is 
determined that the set of next hops is different from the updated set of next hops. 

30 According to another embodiment, a method for selectively delivering 

frames in a fibre channel fabric is provided. A channel for forwarding frames from 
a first fibre channel entity to a second fibre channel entity is identified. The channel 
comprises a plurality of links connecting the first fibre channel entity to the second 
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fibre channel entity. A change in the channel is detected at the first fibre channel 
entity. An updated channel for forwarding frames from the first fibre channel entity 
to the second fibre channel entity is identified. The updated channel is different 
from the channel. Frames received for forwarding on the updated channel that have 
5 not yet been placed into output queues associated with the updated channel are 
blocked. 

In yet another embodiment, a method for selectively delivering frames in a 
fibre channel fabric is provided. A fibre channel fabric link change is detected at a 

10 fibre channel switch. An updated routing table associated with a topology version 
number is generated. Generating an updated routing table includes determining a 
next hop, an incoming label, and a destination corresponding to each entry in the 
updated routing table. A frame is received at the fibre channel switch. The frame 
includes a first destination and a first label corresponding to a first entry in the 

15 updated routing table. It is determined whether the fibre channel switch has 
received a first outgoing label corresponding to the first entry in the updated routing 
table, the received first outgoing label having the same topology version number as 
the updated routing table. The frame is dropped if it is determined that the fibre 
channel switch has not received the first outgoing label. 

20 

These and other features and advantages of the present invention will be 
presented in more detail in the following specification of the invention and the 
accompanying figures, which illustrate by way of example the principles of the 
invention. 

25 Brief Description of the Drawings 

The invention may best be understood by reference to the following 
description taken in conjunction with the accompanying drawings, which are 
illustrative of specific embodiments of the present invention. 

30 

Figure 1 is a diagrammatic representation of a network that can use the 
techniques of the present invention. 
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Figure 2 is a diagrammatic representation of a fibre channel fabric 
undergoing a fibre channel fabric link change. 

Figure 3A is a diagrammatic representation of a routing table showing a set 

of next hops 

5 Figure 3B is a diagrammatic representation of a routing table showing an 

updated set of next hops. 

Figure 4 is a diagrammatic representation showing virtual output queues. 
Figure 5 is a process flow diagram showing the blocking of fibre channel 

frames. 

10 Figure 6 is a process flow diagram showing the dropping of fibre channel 

frames. 

Figure 7 is a diagrammatic representation of possible reordering at a 
channel. 

Figure 8A is a diagrammatic representation of a forwarding channel table. 
15 Figure 8B is a diagrammatic representation of an updated forwarding 

channel table. 

Figure 9 is a process flow diagram showing the forwarding of fibre channel 
frames upon a change in the channel. 

Figure 10 is a diagrammatic representation of label switching routers. 
20 Figure 1 1 is a diagrammatic representation of label switching routers during 

a link change. 

Figure 12 is a diagrammatic representation of label switching routers while 

out labels are partially resolved. 

Figure 13 is a process flow diagram showing techniques for in order delivery 

25 using input and outgoing labels. 

Detailed Description of Specific Embodiments 

Reference will now be made in detail to some specific embodiments of the 
invention including the best modes contemplated by the inventors for carrying out 
30 the invention. Examples of these specific embodiments are illustrated in the 
accompanying drawings. While the invention is described in conjunction with these 
specific embodiments, it will be understood that it is not intended to limit the 
invention to the described embodiments. On the contrary, it is intended to cover 
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alternatives, modifications, and equivalents as may be included within the spirit and 
scope of the invention as defined by the appended claims. 

Methods and apparatus of the present invention provide for in order delivery 
5 of fibre channel frames. According to various embodiments, several network 
conditions can lead to the out of order delivery of frames to fibre channel device. 
The techniques of the present invention provide for delaying, blocking, dropping, 
and/or labeling certain fibre channel frames to deliver frames to a fibre channel 
device in order. In one embodiment, frames that will be traversing a new path are 
10 blocked to allow frames travelling along the old path to either reach the destination 
first or be dropped from the network. 

Figure 1 is a diagrammatic representation of one example of a network that 
can use the techniques of the present invention. Figure 1 shows a storage area 

15 network implemented using fibre channel. A switch 101 is coupled to switches 103 
and 105 as well as to a host 1 1 1 and storage 121 . In one embodiment, host 1 1 1 is a 
server or client system while storage 121 is any storage subsystem such as a single 
disk or a redundant array of independent disks (RAID). Switch 105 is coupled to 
switch 107. Switch 107 is connected to host 113 and switch 103 is connected to 

20 storage 123. Switch 109 is connected to host 115, switch 107, host 153, and an 
external network 151 that may or may not use fibre channel. In order for a host 111 
to access network 151, a path going through switch 105 can be used. It should be 
noted that any apparatus including a processor, memory, and a connection to a fibre 
channel fabric can be a fibre channel switch. 

25 

Ports used to connect switches to each other in a fibre channel network are 
referred to herein as non F-ports while ports used to connect a switch to a host a 
referred to herein as F-ports. In one example, non F-ports are used to connect 
switch 105 to switch 107 while F-poits are used to connect switch 107 to host 113. 
30 Similarly, FL-ports are used to connect switch 103 to storage 123. Ports such as F- 
ports and FL-pprts are herein referred to as edge ports. Other ports are referred to as 
non-edge ports. 



5 



WO 03/085910 



PCT/LS03/09328 



According to various embodiments, a frame transmitted from host 111 to a 
network 151 or to a storage device 153 includes parameters such as the exchange 
identifier, a sequence, and a sequence number. The exchange identifier can provide 
information on what exchange the frame belongs to. The sequence can provide 
5 information on what portion of the exchange the frame belongs to while the 
sequence number can provide information on how the frames should be ordered. 
Sequence numbers can be used to allow for in order delivery of fibre channel 
frames. 

10 Some fibre channel devices such as certain storage disks and disk arrays 

require that frames be received in the order in which they were transmitted. 
Conventional networks such as TCP/IP networks do not have such a requirement, as 
TCP/IP networks generally have mechanisms for reordering packets upon receipt. 
If frames with sequence numbers of 191, 192, and 193 are transmitted in order in a 

15 fibre channel network, a fibre channel device receiving the frames may expect that 
the frames are in the same order in which they were transmitted. A fibre channel 
device may not be able to handle receiving the frames out of order. 

In a static fibre channel network, frames will typically be received in the 
20 order in which they were transmitted. However, several occurrences can lead to the 
out of order delivery of fibre channel frames. Links state changes in particular can 
lead to out of order delivery. 

Figure 2 is a diagrammatic representation of a fibre channel fabric 
25 undergoing a fibre channel fabric link change. Figure 2 shows one example of a 
link change that can lead to out of order delivery of fibre channel frames. A new 
link using non edge-ports is introduced between switch 103 and switch 107. With 
the introduction of a new link between switch 103 and switch 107, a new version of 
the routing tables can be generated. A variety of routing table generation algorithms 
30 such as Fibre Channel Shortest Path First (FSPF) can be used. Traffic from host 
111 travelling through switches 101, 105, and 107 to switch 109 now can travel 
through switches 101, 103, and 107 to switch 109. The set of next hops available to 
transmit frames from a switch 111 to a storage device 153 is switch 105 before the 
introduction of the new link. 

6 
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A set of neighboring fibre channel entities available to transmit a frame from 
one fibre channel entity to another fibre channel entity is referred to herein as a set 
of next hops. After a link change, a set of next hops can be updated in a switch. In 

5 one example, the set of next hops for transmitting frames from switch 101 to a 
network 151 changes from simply switch 105 to both switch 103 and switch 105 
after link state change. A set of neighboring entities that can be used to transmit 
frames from a source to a destination updated after a link change or the generation 
of updated routing tables is referred to herein as an updated set of next hops. It 

10 should be noted that a set of next hops can include one or more neighboring nodes. 
In one example, a set of next hops is a single neighboring entity. In another 
example, a set of next hops includes multiple neighboring entities. 

Having an updated set of next hops can lead to out of order delivery of fibre 
15 channel frames. In one example, the earlier frames transmitted at switch 101 in a 
sequence may be travelling through switch 105 while later frames in the same 
sequence may be travelling through a switch 103. A variety of network conditions 
can cause the later frames travelling through a switch 103 to arrive at switch 109 
before the earlier frames travelling through a switch 105 arrive at switch 109. In 
20 one example, the earlier frames are slowed at a switch 105 because of congestion at- 
switch 105 while the later frames travel through switch 103 quickly because of a 
new high bandwidth link between switch 103 and switch 107. A storage device 153 
receiving the later transmitted frames before receiving the earlier transmitted frames 
from host 111 may not be able to handle the out of order frames. 

25 

Figures 3 A and 3B depict routing tables showing information relating to sets 
of next hops and updated sets of next hops at a switch 101. Figure 3A shows sets of 
next hops for a frame received at switch 101 and a network where a link has not yet 
been established between switch 103 and switch 107. When a frame is received at 
30 switch 101, an identifier depicting the destination of the frame can be used to 
reference an entry in the routing table. In one example, the destination of the frame 
is switch 107 and the entry 309 can be referenced to determine that the set of next 
hops is switch 105. If it is determined that the destination of the received frame is 
switch 101, the routing table can be used to pass the frame to a processor associated 



WO 03/085910 



PCT/LS03/0.9328 



with switch 101. A routing table can also be used to drop frames. In one example, 
a value such as a null value can be placed in the set of next hops and a frame having 
a destination associated with the null value can be dropped upon referencing a 
routing table. An entry directing that a frame with a particular destination be 
5 dropped is referred to herein as an adjacency drop. 

In one embodiment, a routing table is provided for each virtual storage area 
network (VSAN) that the switch is a part of It should be noted that a fibre channel 
switch can be a part of many different VSANs and a routing table can be provided 
1 0 for each VSAN the switch is associated with. 

After a link is added connecting switch 103 to switch 107, the routing table 
is updated. Figure 3B is a diagrammatic representation of a routing table showing 
updated sets of next hops. According to various embodiments, a frame having a 
15 destination set as switch 107 can be forwarded along either switch 103 or switch 
105 based on entry 329. In a stable topology, all frames in a particular flow or 
exchange follow the same path. 

It should be noted that a routing table may allow both paths or it can select a 
20 best path. If the best path selected has a next hop of switch 105 the updated set of 
next hops is the same as the set of next hops before the link change as shown in 
Figure 3A. If the path selected is switch 103 for a frame with a destination of 
switch 107, the updated set of next hops is different from the set of next hops in the 
routing table before the link change. Determining whether an updated set of next 
25 hops is different from an original set of next hops can be useful for deciding on 
whether to block or drop a particular frame. In one example, frames are not blocked 
or dropped if the path for a particular sequence of frames remains unchanged even 
after a link change in the fibre channel fabric. If the path for the sequence of frames 
remains the same, the frames will be delivered in order to a destination. If the path 
30 for the sequence of frames has changed, there is risk that frames will be delivered 
out of order. 

One of the mechanisms that can impact the order in which fibre channel 
frames are delivered are the queues within a fibre channel switch. A frame 
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transmitted first from a host can remain in a queue associated with a switch 105 
while a frame transmitted later from a host can be delivered quickly through a 
switch 103. Figure 4 is a diagrammatic representation of queues that can be 
associated with a fibre channel switch, according to various embodiments. 
5 Although one particular type of queue will be described, it should be noted that a 
variety of different input and output queues associated with various input and output 
ports can be used to implement the techniques of the present invention. 

A switch 401 is connected to external nodes 451, 453, 455, and 457. The 
10 switch 401 includes a buffer 403 of shared memory associated with each switch 
port. A buffer 403 is associated with external node 451. Buffers associated with 
external nodes 453, 455, and 457 are not shown for purposes of clarity. The buffer 
403 can hold traffic destined for external nodes 453, 455, 457, and loop back traffic 
to external node 451. 

15 

In typical implementations, frames destined for the various external nodes 
are all placed in the same buffer 403. Consequently, when a switch 401 receives a 
large volume of frames destined for a particular node such as external node 453, 
frames associated with external node 453 can use the entire buffer 403. According 
20 to various embodiments, the frames stored in buffer 403 are referenced by pointers 
in frame descriptor queues 411-447. Each frame descriptor can contain a pointer or 
reference identifying where the frame is stored in the buffer 403. Pointers or 
references to a shared buffer are herein referred to as descriptors. Descriptors can 
also identify other information such as frame priority. 

25 

In one example, an arbitrator 405 selects frames using a round-robin 
methodology. In a first round, a frame destined for external node 453 is selected. 
In a second round, a frame destined for external node 455 is selected, etc. More 
particularly, the arbitrator 405 may first select a high priority frame associated with 
30 descriptor 411 destined for external node 453, then select a high priority frame 
associated with descriptor 421 destined for external node 455, then select a high 
priority frame associated with descriptor 431 destined for external node 457, etc. It 
should be noted that a variety of techniques for selecting a frame can be used, as 
will be appreciated by one of skill in the art. 
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A queuing system having buffers apportioned based on destination is 
referred to herein as virtual output queuing (VOQ). VOQ is described further in 
Tamir Y., Frazier G.: "High Performance multi-queue buffers for VLSI 

5 communications switches", Proc. Of 15th Ann. Symp. On Comp. Arch., pp.343- 
354, June 1988, the entirety of which is incorporated by reference for all purposes. 
An abstraction identifying traffic with particular characteristics between two nodes 
is herein referred to as a flow. In one example, a' flow is referenced by a source 
identifier, a destination identifier, a priority, a class, and an exchange identifier. 

10 Other characteristics are also possible. It should be noted, however, that a flow may 
also be referenced merely by a source and destination identifier. 

According to various embodiments, frames in a particular flow may be 
blocked because a buffer 403 is full. If an alternative route is provided for frames in 

15 the same sequence, later frames may be able to traverse a fibre channel fabric more 
quickly than a frame remaining in a congested switch. In one implementation, later 
frames can be blocked along an alternative route to allow earlier frames to reach a 
destination first. Frames transmitted first are herein referred to as earlier frames 
while frames transmitted later by a source are referred to herein as later frames. 

20 Later frames can be blocked using several mechanisms. In one embodiment, the 
arbitrator 405 may simply not select frames of the sequence for transmission to 
external nodes. In another embodiment, the later frames may not be queued for 
transmit scheduling at all until a period of time has elapsed. 

25 The period of time can be determined in several manners. According to 

various embodiments, a fibre channel switch is configured to buffer a frame for no 
more than a fibre channel switch latency before the frame is dropped. It should be 
noted, however, that some switches may not drop a frame after a fibre channel 
switch latency has elapsed. In one embodiment, if the frame is held in a buffer in 

30 the switch for more than a latency period, a frame is dequeued from the virtual 
output queue and dropped. The fibre channel switch latency may depend on 
switching speeds and network congestion. The amount of time the frame can spend 
in a fibre channel switch before being dropped is referred to herein as the fibre 
channel switch latency. 

10 
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The amount of time a frame can spend in a fibre channel network before 
being dropped is referred to herein as the fibre channel fabric drain latency or 
network drain latency. According to various embodiments, the fibre channel fabric 
5 drain latency or network drain latency is calculated by multiplying the fibre channel 
switch latency by the maximum number of hops it takes for a frame to traverse the 
fibre channel fabric. A wide variety of techniques for determining fibre channel 
switch latency and fibre channel fabric drain latency are contemplated. 

10 Figure 5 is a process flow diagram showing the forwarding of frames 

arriving from a connected host or disk. At 501, a fibre channel fabric link change is 
detected. As noted above, the fibre channel fabric link change can lead to the out of 
order delivery of a frame sequence to a fibre channel device. According to various 
embodiments, A fibre channel fabric link change can be detected based on the 

15 receipt of link update messages or the transmission of link update messages. At 
503, updated sets of next hops are generated for each destination based on the new 
link state information. The updated set of next hops can be generated using 
algorithms such as Fibre Channel Shortest Path First (FSPF). With new information 
about network topology, a switch may better be able to determine what is the best 

20 path for transmitting a frame to a particular destination. At 505, it is determined for 
each destination whether the set of next hops is equal to the updated set of next 
hops. If it is determined at 507 that all sets of next hops are equal to the 
corresponding updated sets of next hops, no action is taken. 

25 For example, if the original path for transmitting a frame to a destination was 

through switches 103 and 107 while the new path for transmitting a frame to a 
destination is also through switches 103 and 107, no action is necessary. However, 
if the updated path for transmitting a frame to a destination is 105 and 107, the 
updated set of next hops is not equivalent to be original set of next hops. Blocking 

30 and dropping mechanisms can be applied At 509, for each destination with a set of 
next hops different from the corresponding updated set of next hops, the queues or 
virtual output queues for transmission of a frame having that destination are 
blocked. According to various embodiments, blocking the virtual output queue may 
entail not transmitting the frame to a next hop. At 511, the routing table is updated 
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using the updated sets of next hops. The queues with a change in the set of next 
hops can then be blocked during a fabric drain latency or fabric drain period at 513. 

The fabric drain latency allows time for earlier packets that may still remain 
5 at congested network switches to either be delivered or dropped from the network. 
Different fabric drain periods may be associated with each virtual output queue or a 
single fabric drain period can be applied to all the virtual output queues. After a 
fabric drain period has elapsed, the blocked queues are unblocked at 515. Later 
frames in the blocked queues can now be transmitted since earlier frames were 
10 either dropped or delivered to the destination. Later frames can be transmitted to a 
destination without the risk that earlier transmitted frames are still in the network 
waiting to arrive at the destination out of order. 

The techniques described in Figure 5 can be applied at any port in a fibre 
15 channel fabric on a per VSAN, per destination basis. According to various 
embodiments, however, the techniques depicted in Figure 5 are applied at an edge- 
ports connecting a switch to a host or disk. The techniques for edge-ports focus on 
blocking traffic to avoid injecting traffic which will potentially be dropped. 

20 Figure 6 is a process flow diagram showing the forwarding of frames 

between fibre channel switches. At 601, the fibre channel fabric link change is 
detected. At 603, updated sets of next hops are calculated for each destination. At 
605, it is determined for each destination if the set of next hops is equal to the 
updated set of next hops. If all sets of next hops are equal to the corresponding 

25 updated sets of next hops at 607, no action needs to be taken. If the sets of next 
hops are not equal to all the corresponding updated sets of next hops, frames having 
destinations associated with a changed set of next hops are dropped at 609. 

The routing table can be updated at 611 and the fibre channel switch waits 
30 for a corresponding fabric drain period associated with the virtual output queue at 
613. After the fabric drain period has elapsed for a particular queue, frames that 
would be placed in the queue associated with a fabric drain period are now 
forwarded instead of being dropped. 
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As noted above, the techniques of the present invention can apply towards 
the in order delivery of fibre channel frames in a fibre channel fabric upon detection 
of a link change. Typically, when a new node or new link is added or an old node 
or an old link is subtracted from the fibre channel topology, there is a risk of out of 
5 order delivery of fibre channel frames. 

However, changes in network topology are not the only events that can 
trigger out of order delivery. Changes in a channel between two switches can also 
lead to out of order frame delivery, 

10 

Figure 7 is a diagrammatic representation showing a channel change that can 
cause out of order delivery. Switches 107 and 109 may be interconnected originally 
through redundant links 701, 703, and 705 that form a channel 709. Traffic 
traveling between switches 107 and 109 may be distributed across the different links 
15 in Hie channel 709 based on factors such as fairness and load balancing. In another 
example, identical hash functions can be used at the transmitting and receiving 
switches to determine what link to access next Using identical hash functions can 
provide for in order delivery. 

20 When a new link 707 is added to form an updated channel 711, more links 

are now available for transmission between switch 107 and switch 109. It should be 
noted, however, that no change in network topology occurs. Switches 107 and 109 
are still connected in the fibre channel fabric. According to various embodiments, 
no changes or updates to routing tables are necessary when a channel changes. The 

25 routing table may simply instruct that a frame be forwarded from switch 107 to 
switch 109. The forwarding from switch 107 to switch 109 is not affected by the 
addition of a new link 707 to the original channel 709 to form the updated channel 
711. 

30 However, adding a link to a channel can also cause out of order delivery of 

fibre channel frames. Using the example above where frames with sequence 
numbers 1-6 are being transmitted on link 701, 703, or 705, a frame with sequence 
numbers 7 and 8 may be transmitted along link 707. A link 707 may be an 
uncongested or a higher bandwidth link that allows the frames with sequence 
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numbers 7 and 8 to leave switch 109 before the frames 1-6 arrive. The techniques 
of the present invention provide for the blocking and dropping fibre channel frames 
to allow in order delivery to switches and nodes downstream of switch 109. 
Blocking is also herein referred to as delaying. 

5 

As noted above, when a channel changes, the routing table may not 
necessarily change as a link still exists between switch 107 and switch 109. Logic 
and mechanisms for determining what links are available in a channel for 
transmitting a frame are referred to herein as forwarding channel tables. It should 
10 be noted that forwarding channel tables may change. Figure 8A shows a forwarding 
channel table for a switch 107. For a next hop 109, entry 803 provides that links 
701,703, and 705 can be used to transmit a frame from switch 107 to the next hop 
109. 

15 Figure 8B shows a forwarding channel table for a switch 107 after a change 

in channel. After a change in the channel occurs, entry 813 provides information 
that links 701,703, 705, and 707 can be used to transmit from a switch 107 to a next 
hop 109. 

20 Figure 9 is a process flow diagram showing the forwarding the frames 

during a channel change, according to various embodiments. At 901, a switch 
detects a change in the channel. The addition or subtraction of one or more links 
can be a change in the channel. At 903, traffic directed toward the channel based on 
the routing table next hop is dropped. In one embodiment, traffic destined for 

25 transmission through the channel but not yet queued is dropped to allow changes in 
the forwarding channel tables to accommodate the channel change. However, 
traffic already queued remains. As noted above, the traffic already queued may 
reside in virtual output queues associated with particular flows. At 905, the queues 
for transmitting along the links in the channel are marked and blocked. In one 

30 example, marking and blocking the queues includes allowing all frames already in 
the queues to be transmitted while blocking any new frames for the switch drain 
latency time period. 
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At 907, a forwarding channel table can then be updated to add and/or 
remove links in the channel. At 909, traffic heading toward the channel that was 
previously dropped at 903 can now be allowed into the queues. A switch can then 
wait at 911 during a modified fibre channel switch drain latency. According to 

5 various embodiments, a modified fibre channel switch drain latency is longer than 
the standard fibre channel switch drain latency to allow all frames already in the 
queues to be transmitted instead of dropped. At 913, queues associated with the 
links in the channel are released and all frames in the queues are either transmitted 
or dropped if the frame is too old and new frame sequences can now be delivered in 

10 order. 

Although mechanisms such as delaying frames can be used according to 
various embodiments, the techniques of the present invention also contemplate 
using labels to provide for in order delivery of fibre channel frames. One of the 

15 many reasons for using labels is that labels provide a fast mechanism for accessing 
entries in a routing table. Instead of looking at a destination address, the destination 
identifier can be an "in label" or an incoming label that can be used to quickly 
reference a routing table entry. Labels can be used for a variety of other reasons as 
well. Using labels in fibre channel networks is described in concurrently filed U.S. 

20 application No. 10/114,394, titled Label Switching In Fibre Channel Networks 
(Attorney Docket No. ANDIP009) by inventors Scott S. Lee and Dinesh G. Dutt, 
the entirety of which is incorporated by reference for all purposes. One example of 
an architecture for implementing label switching is Multiple Protocol Label 
Switching (MPLS) described in RFC 3031, the entirety of which is also 

25 incorporated by reference for all purposes. 

Figure 10 is a diagrammatic representation of a fibre channel network 
including labels switching routers that can forward packets based on labels 
associated with frames. In addition to containing the destination address, a frame 
30 includes as a destination identifier an in label also referred to herein as an incoming 
label that allows a switch to quickly access an entry in a routing table. For example, 
a label switching router 1004 can receive a frame with a destination of 2 and an in 
label of 420. It should be noted that in Figures 10-12, a destination or a next hop of 
1-5 refers to a switch 1001-1005. The label switching router 1004 can access its 
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routing table 1014 to recognize that the next hop is label switching router 1002 and 
the "out label" should be 220. The out label is also referred to herein as an outgoing 
label. According to various embodiments, the label switching router 1004 replaces 
the frame label value of 420 corresponding to the in label in the routing table with a 
5 frame label of 220 corresponding to the out label in the routing table 1014. 

By replacing the label value, the label switching router 1004 provides label 
information to the next hop router 1002, to allow the label switching router 1002 to 
similarly access a routing table entry quickly. It should be noted that although label 
10 switching can be provided for fast access of entries in a routing table, label 
switching can be used for a variety of reasons. The techniques of the present 
invention provide that frames can be delivered in order by using labels. 

When a label switching router 1002 receives a frame from label switching 
15 router 1004, the label switching router uses the label 220 to access an entry in the 
routing table 1012. Using the in label 220, the label switching router 1002 
recognizes that the frame has reached the ultimate hop switch and no longer needs 
to be forwarded to another switch. The frame can then be forwarded to the ultimate 
destination which may be a host or a disk. 

20 

A variety of techniques can be used for generating routing tables with labels. 
In one embodiment, routing tables are generated upon the receipt of link state 
update packets under the FSPF protocol. Routing tables can be generated 
periodically or upon the identification of a change in link state. According to 

25 various embodiments, a newly generated routing table is associated with an 
incarnation number. A combination of all the incarnation numbers in a fibre 
channel fabric is herein referred to as a topology version number. In one 
embodiment, every time a new routing table is generated at a switch, the incarnation 
number is incremented by one. According to various embodiments, each label 

30 switching router in a fibre channel network not only generates new forwarding 
routes toward each destination, but each label switching router also generates new in 
labels different from the previous set of in labels. 



16 



WO 03/085910 



PCT/US03/09328 



Figure 1 1 is a diagrammatic representation showing label switching routers 
during a change in link state. Here, the link between label switching router 1004 
and label switching router 1001 is no longer available. Link state update packets or 
link state records are propagated throughout the fibre channel fabric. To transmit to 

5 a label switching router 1001 from a label switching router 1004, label switching 
router 1004 can no longer forward directly to label switching router 1001. Instead, 
label switching router 1004 sends frames towards either label switching router 1002 
or label switching router 1003. The new routes are reflected in the routing table 
1 1 14. The label switching router 1004 generates a new routing table 1 114 with the 

10 new set of next hops and generates a new in label 41 1 replacing the old in label 410. 

According to various embodiments, the label switching router 1004 sends 
label switching control messages to the other label switching routers in the fibre 
channel fabric in order to remove the outdated 410 in label. The label switching 

15 control messages sent to other label switching routers in the fibre channel fabric 
include a topology version number. In one embodiment, the other label switching 
routers verify that the topology version number of the label switching control 
messages matches the topology version number of the routing tables. If the 
topology version number of the label switching control message does not match the 

20 topology version number of the routing table in a particular label switching router or 
if the control message contains an older version number, the label switching control 
message is discarded. 

If the topology version number of the label switching control message is 
25 newer than the topology version number of the routing table in a particular label 
switching router, the label switching control message can be stored and used later 
when the newer version of the routing table is available. 

If the topology version numbers match, the label switching router receiving 
30 the label switching control message can remove the outdated label. For example, if 
a label switching router 1004 generates a new in label 411 replacing an old in label 
410 and transmits a label switching control message to withdraw the outdated label 
to a label switching router 1002, label switching router 1002 will remove the out 
label 410 associated with a destination ID of 1 and a next hop of 4. According to 
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various embodiments, the label switching control message to remove an outdated 
label is referred to herein as a label withdraw message. 

While out labels are unresolved, fibre channel frames are dropped to prevent 
5 out of order delivery of fibre channel frames. In Figures 1 1 and 12, a "?" is used to 
indicate that a label is unresolved. For example, suppose label switching router 
1005 transmitted frames 1 and 2 to label switching router 1001 in order to 
eventually reach label switching router 1004. After the failure of the link between 
label switching router 1001 and label switching router 1004, label switching router 
10 1005 transmitted frames 3 and 4 to label switching router 1002 in order to 
eventually reach label switching router 1004. If frames 1 and 2 are not dropped, 
they could arrive at label switching router 1004 after frames 3 and 4, However, 
because the out label at label switching router 1001 for destination label switching 
router 1004 is not resolved after the link failure, frames 1 and 2 are dropped and 
15 frames 3 and 4 can arrive in order at label switching router 1004. In other words, 
because the out label 440 associated with destination 4 in the routing table 1 1 1 1 is 
no longer accurate, frames 1 and 2 can be dropped. 

To resolve the out labels, label switching router 1004 advertises the new in 
20 labels including in label 411 to other label switching routers such as label switching 
router 1002. The label switching control message to add a new label is referred to 
as a label mapping message. In one example, label switching router 1004 sends a 
label mapping message having a label to label switching router 1002. The label 
mapping message received at label switching router 1002 can instruct the label 
25 switching router 1002 to use the new in label 411 as the new out label associated 
with the next hop of 4 and the destination of 1. 

Figure 12 is a diagrammatic representation showing label switching routers 
in a fibre channel fabric where out labels are not yet completely resolved. Label 
30 switching router 1004 has generated a routing table 1214 that has replaced an old set 
of in labels 410, 420, 430, 440, and 450 with an updated stack of in labels 411, 421, 
431, 441, and 451 respectively. The label switching router 1004 has also resolved 
its out labels and has replaced out labels 1 10, 110, 220, 330, 250, and 350, with new 
out labels 211, 311, 221, 331, 251, and 351, respectively. According to various 
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embodiments, out labels are resolved at label switching router 1004 after the router 
1004 has received from all other routers label mapping messages with topology 
version numbers corresponding to that of the routing table 1214. 

5 Label switching router 1002, however, has not fully result out labels in 

routing table 1212. Although the routing table 1212 contains a new set of in labels, 
the set of out labels his only partially resolved. In particular, although the label 
switching router 1002 has new out labels corresponding to the next hop of 4, it does 
not have new out labels corresponding to the next hop of 5. This may have resulted 

10 from the receipt of a label mapping message from label switching router 1004 but 
no label mapping message from label switching router 1005. If the label switching 
router 1002 receives a frame for transmission to a next hop of 5, the frame is 
dropped because the out labels associated with the next hop of five are unresolved. 

15 As link state update messages and label switch control messages are passed 

throughout the fibre channel fabric, every switch eventually can resolve the out 
labels in the same manner label switching router 1004 and label switching router 
1001 have resolved the out labels in Figure 12. 

20 Figure 13 is a flow process diagram showing the forwarding of frames using 

labels. At 1301, a fibre channel fabric link changes detected. At 1303, updated sets 
of next hops for each destination are calculated based on link state information. 
According to various embodiments, the topology version number is identified at 
1305. The topology version number can be a combination of the incarnation 

25 numbers of all of the switches in a fibre channel fabric. In one embodiment, the 
topology version number includes all of the incarnation numbers appended together. 
In another embodiment, the topology version number is a check sum of the various 
incarnation numbers. And in still another embodiment, the topology version 
number is a unique number meaningful to the switches in the network. At 1307, 

30 switches can transmit label switching control messages to withdraw previously 
advertised labels. At 1309, each label switching router can generate new in or 
incoming labels and advertise the new in or incoming labels to the other switches in 
the fabric. 
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It should be noted that the techniques of the present invention do not 
necessarily have to be performed in any particular order. For example, in one 
embodiment a label switching router can generate and advertise an in labels at the 
same time out labels are withdrawn at the other label switching routers. 

5 

At 1311, frames with old labels are dropped. Frames are also dropped while 
out labels are not resolved. For example, if the out labels associated with a next hop 
of six are not resolved, frames configured for transmission to a next hop of six are 
dropped. At 1313, out labels are resolved by using information from the label 
10 mapping messages or information from advertisements from the other label 
switching routers. Frames no longer are dropped, as out labels are resolved at 1315. 

While the invention has been particularly shown and described with 
reference to specific embodiments thereof, it will be understood by those skilled in 

15 the art that changes in the. form and details of the disclosed embodiments may be 
made without departing from the spirit or scope of the invention. For example, 
embodiments of the present invention may be employed with a variety of network 
protocols and architectures. Instructions such as quench messages can be sent at a 
variety of different times. It is therefore intended that the invention be interpreted to 

20 include all variations and equivalents that fall within the true spirit and scope of the 
present invention. 
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CLAIMS 

What is claimed is: 

1. A method for selectively delivering fibre channel frames in a fibre 
5 channel fabric, the method comprising: 

identifying a set of next hops at a fibre channel entity, wherein the set of 
next hops is used to forward frames based on a destination identifier; 
detecting a fibre channel fabric link change; 

identifying an updated set of next hops, wherein the updated set of next hops 
10 is used to forward a frame received at a fibre channel entity based on a destination 
identifier while accounting for the fibre channel fabric link change; 

comparing the set of next hops to the updated set of next hops; and 
preventing the forwarding of frames towards the updated set of next hops for 
a predetermined period of time if it is determined that the set of next hops is 
15 different from the updated set of next hops. 

2. The method of claim 1 , wherein the predetermined period of time is a 
fibre channel fabric drain period. 

3 . The method of any of claims 1-2, wherein preventing the forwarding 
of frames towards the updated set of next hops comprises: 

20 blocking frames scheduled for forwarding to the updated set of next hops. 

4. The method of claim 3, wherein the frames are blocked in virtual 
output queues associated with the updated set of next hops. 

5 . The method of claim 3, further comprising 

updating a routing table to forward frames to the updated set of next hops. 
25 6. The method of claim 5, wherein the fibre channel entity is a fibre 

channel switch comprising a plurality of routing tables, wherein each routing table 
is associated with a particular virtual storage area network. 

7. The method of claim 5, further comprising: 

allowing the forwarding of frames to the updated set of next hops after the 
30 predetermined period of time has elapsed and the routing table has been updated to 
schedule frames for forwarding to the updated set of next hops. 

8. The method of claim 7, further comprising: 

dropping frames that have been held in the switch for more than a fibre 
channel switch drain period. 
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9. The method of claim 8, further comprising: 

determining that the fibre channel switch is associated with a non edge-port 
before preventing the forwarding of frames to the updated set of next hops. 

10. The method of any of claims 1-9, wherein preventing the forwarding 
5 of frames towards the updated set of next hops comprises: 

dropping frames scheduled for forwarding to the updated set of next hops, 

1 1 . The method of claim 1 0, further comprising 

updating a routing table to forward frames to the updated set of next hops. 

12. The method of claim 11, wherein the fibre channel entity is a fibre 
10 channel host comprising a plurality of routing tables, wherein each routing table is 

associated with a particular virtual storage area network. 

1 3 . The method of claim 1 1 , further comprising: 

allowing the forwarding of frames to the updated set of next hops after the 
predetermined period of time has elapsed and the routing table has been updated to 
15 schedule frames for forwarding to the updated set of next hops. 

14. The method of claim 13, further comprising: 

determining that the fibre channel host is associated with a edge-port before 
preventing the forwarding of frames to the updated set of next hops. 

15. A method for selectively delivering frames in a fibre channel fabric, 
20 the method comprising: 

identifying a channel for forwarding frames from a first fibre channel entity 
to a second fibre channel entity, wherein the channel comprises a plurality of links 
connecting the first fibre channel entity to the second fibre channel entity; 

detecting a change in the channel at the first fibre channel entity; 
25 identifying an updated channel for forwarding frames from the first fibre 

channel entity to the second fibre channel entity, wherein the updated channel is 
different from the channel; and 

dropping frames received for forwarding on the updated channel that have 
not yet been placed into output queues associated with the updated channel. 
30 16. The method of claim 1 5, further comprising: 

marking the output queues associated with the links in the updated channel 
to block all frames after the frames already in the queues are delivered. 

17. The method of claim 16, wherein dropping frames comprises 
updating a forwarding channel table to forward frames to a drop adjacency. 
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1 8. The method of claim 16, wherein the output queues are virtual output 

queues. 

19. The method of claim 16, further comprising: 

updating a forwarding channel table to forward frames on the updated 
5 channel. 

20. The method of claim 19, further comprising: 

waiting for a period of time derived using the fibre channel switch latency. 

2 1 . The method of claim 1 9, further comprising: 

waiting for a period of time greater than the fibre channel switch latency. 
10 22. The method of claim 20, further comprising: 

releasing the output queues to allow forwarding of frames on the updated 
channel after the period of time has elapsed and the forwarding channel table has 
been updated to forward frames on the updated channel. 

23. The method of claim 22, wherein the forwarding* channel table is 
1 5 updated using a hash modulo. 

24. A method for selectively delivering frames in a fibre channel fabric, 
the method comprising: 

detecting a fibre channel fabric link change at a fibre channel switch; 

generating an updated routing table associated with a topology version 
20 number, wherein generating an updated routing table comprises determining a next 
hop, an incoming label, and a destination corresponding to each entry in the updated 
routing table; 

receiving a frame at the fibre channel switch, the frame having a first 
destination and a first label corresponding to a first entry in the updated routing 
25 table; 

detemiining whether the fibre channel switch has received a first outgoing 
label corresponding to the first entry in the updated routing table, the received first 
outgoing label having the same topology version number as the updated routing 
table; and 

30 dropping the frame if it is determined that the fibre channel switch has not 

received the first outgoing label. 

25. The method of claim 24, wherein the first label is equivalent to a first 
incoming label associated with the first entry. 

26. The method of any of claims 24-25, further comprising: 
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forwarding the frame to the next hop if it is determined that the fibre channel 
switch has received the first outgoing label. 

27. The method of claim 26, wherein the routing table is associated with 
a particular virtual storage area network. 
5 28. The method of claim 27, wherein the topology version number is 

derived using incarnation numbers with each switch in the virtual storage area 
network. 

29. The method of claim 28, wherein the input and outgoing labels are 
MPLS labels. 

10 30. The method of claim 29, wherein the first label is contained in the 

EISL header associated with the fibre channel frame. 

31. The method of claim 30, wherein the first label in the frame is 
contained in the fibre channel frame header. 

32. The method of any of claims 25-32, further comprising: 

15 receiving a first link state control message having the first outgoing label and 

a topology version number corresponding to the topology version number of the 
routing table. 

33. The method of claim 32, further comprising: 

advertising the first incoming label to other switches in the fibre channel 

20 fabric. 

34. A fibre channel entity for selectively delivering fibre channel frames 
in a fibre channel fabric, the method comprising: 

means for identifying a set of next hops at a fibre channel entity, wherein the 
set of next hops is used to forward frames based on a destination identifier; 
25 means for detecting a fibre channel fabric link change; 

means for identifying an updated set of next hops, wherein the updated set of 
next hops is used to forward a frame received at a fibre channel entity based on a 
destination identifier while accounting for the fibre channel fabric link change; 

means for comparing the set of next hops to the updated set of next hops; 

30 and 

means for preventing the forwarding of frames towards the updated set of 
next hops for a predetermined period of time if it is determined that the set of next 
hops is different from the updated set of next hops. 
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Figure 5 
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Figure 6 
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Figure 8A 
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Figure 9 
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